Hierarchical Part-Based Human Body Pose Estimation
نویسندگان
چکیده
This paper addresses the problem of automatic detection and recovery of three-dimensional human body pose from monocular video sequences for HCI applications. We propose a new hierarchical part-based pose estimation method for the upper-body that efficiently searches the high dimensional articulation space. The body is treated as a collection of parts linked in a kinematic structure. Search for configurations of this collection is commenced from the most reliably detectable part. The rest of the parts are searched based on the detected locations of this anchor as they all are kinematically linked. Each part is represented by a set of 2D templates created from a 3D model, hence inherently encoding the 3D joint angles. The tree data structure is exploited to efficiently search through these templates. Multiple hypotheses are computed for each frame. By modelling these with a HMM, temporal coherence of body motion is exploited to find a smooth trajectory of articulation between frames using a modified Viterbi algorithm. Experimental results show that the proposed technique produces good estimates of the human 3D pose on a range of test videos in a cluttered environment.
منابع مشابه
تخمین چنددوربینی حالت سه بعدی انسان با برازش افکنش مدل اسکلت سه بعدی مفصل دار در تصاویر سایه نما
Automatic capture and analysis of human motion, based on images or video is important issue in computer vision due to the vast number of applications in animation, surveillance, biomechanics, Human Computer Interaction, entertainment and game industry. In these applications, it is clear that 3D human pose estimation is an essential part. Therefore, its accuracy has a great effect on the perform...
متن کاملAttribute And-Or Grammar for Joint Parsing of Human Attributes, Part and Pose
This paper presents an attribute and-or grammar (A-AOG) model for jointly inferring human body pose and human attributes in a parse graph with attributes augmented to nodes in the hierarchical representation. In contrast to other popular methods in the current literature that train separate classifiers for poses and individual attributes, our method explicitly represents the decomposition and a...
متن کاملAn Augmented Reality Based User Interface Model for Full Body Interaction using Monocular 3D Pose Estimation
In this paper, we present a conceptual design of augmented reality full body interaction based on monocular 3D pose estimation. The proposed design is based on 3D pose estimation from the image of user’s motions captured by a monocular camera and the processing of 3D human poses for augmented reality applications. Based on the method, a 3D human full body model is constructed. The silhouettes e...
متن کاملHierarchical Approach for Articulated 3D Pose-Estimation and Tracking
In the recent years we presented a number of methods for a fully automatic pose estimation [5, 7] and tracking [6] of human bodies in 2D [5] and 3D [6]. Initialization and failure recovery in these methods are facilitated by the use of loose-limbed body model [7] in which limbs are connected via learned probabilistic constraints. The pose estimation and tracking can then be formulated as an inf...
متن کاملA Framework for Human Pose Estimation in Videos
In this paper, we present a method to estimate a sequence of human poses in unconstrained videos. We aim to demonstrate that by using temporal information, the human pose estimation results can be improved over image based pose estimation methods. In contrast to the commonly employed graph optimization formulation, which is NP-hard and needs approximate solutions, we formulate this problem into...
متن کامل